Revealing Predictive Gene Clusters with Supervised Algorithms

نویسنده

  • Marcel Dettling
چکیده

Microarray technology allows the measurement of expression levels of thousands of genes simultaneously and is expected to contribute significantly to advances in fundamental questions of biology and medicine. While microarrays monitor thousands of genes, there is a lot of evidence that only a few underlying signature components of gene subsets account for nearly all of the outcome variation. Here, methodology for revealing these predictive gene clusters in microarray data is presented. For this task, we focus on supervised algorithms, defined as clustering techniques which utilize external information about the response variables for grouping the explanatory variables (genes). In studies where external response variables are available, our approach is often more effective than unsupervised techniques such as hierarchical clustering.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mst-based Semi-supervised Clustering Using M-labeled Objects

Most of the existing semi-supervised clustering algorithms depend on pairwise constraints, and they usually use lots of priori knowledge to improve their accuracies. In this paper, we use another semi-supervised method called label propagation to help detect clusters. We propose two new semi-supervised algorithms named K-SSMST and M-SSMST. Both of them aim to discover clusters of diverse densit...

متن کامل

Determination of Best Supervised Classification Algorithm for Land Use Maps using Satellite Images (Case Study: Baft, Kerman Province, Iran)

According to the fundamental goal of remote sensing technology, the image classification of desired sensors can be introduced as the most important part of satellite image interpretation. There exist various algorithms in relation to the supervised land use classification that the most pertinent one should be determined. Therefore, this study has been conducted to determine the best and most su...

متن کامل

Machine Learning Techniques for Thyroid Cancer Diagnosis

Drawing inspiration from Alexander’s paper on classification of thyroid cancer, we are interested in replicating and possibly improving the predictive results of a learning model for detecting thyroid cancer from gene expression data from thyroid nodules. This data set is the same data used in the paper by Alexander. We will develop our own gene expression classifier by applying different featu...

متن کامل

An Improved Semi-Supervised Clustering Algorithm Based on Active Learning

In semi supervised clustering is one of the major tasks and aims at grouping the data objects into meaningful classes (clusters) such that the similarity of objects within clusters is maximized and the similarity of objects between clusters is minimized. The dataset sometimes may be in mixed nature that is it may consist of both numeric and categorical type of data. Naturally these two types of...

متن کامل

Fuzzy-Rough Supervised Attribute Clustering Algorithm and Classification of Microarray Data

One of the major tasks with gene expression data is to find groups of coregulated genes whose collective expression is strongly associated with sample categories. In this regard, a new clustering algorithm, termed as fuzzy-rough supervised attribute clustering (FRSAC), is proposed to find such groups of genes. The proposed algorithm is based on the theory of fuzzy-rough sets, which directly inc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003